Feeds to Scour
SubscribedAll
Scoured 18231 posts in 415.2 ms
PLA-Serve: A Prefill-Length-Aware LLM Serving System
arxiv.org·1d
🏗️LLM Infrastructure
Preview
Report Post
weitianxin/Awesome-Agentic-Reasoning
github.com·43m
🪄Prompt Engineering
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.com·17h·
Discuss: Hacker News
🏗️LLM Infrastructure
Preview
Report Post
Power Aware Dynamic Reallocation For Inference
arxiv.org·1d
🏗️LLM Infrastructure
Preview
Report Post
Streamlining CUB with a Single-Call API
developer.nvidia.com·12h
🏟️Arena Allocators
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·8h·
Discuss: Hacker News
🏗️LLM Infrastructure
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·16h
Hardware Acceleration
Preview
Report Post
Hot 24h update on the AI x Web3 stack that’s taking the space by storm:
threadreaderapp.com·3h
🖥GPUs
Preview
Report Post
Retrieve and Rerank: Personalized Search Without Leaving Postgres
paradedb.com·1d
👤Search Personalization
Preview
Report Post
Arctic Wolf’s Liquid Clustering Architecture Tuned for Petabyte Scale
databricks.com·15h
ClickHouse
Preview
Report Post
How Pinterest Built An Async Compute Platform for Billions of Task Executions
blog.bytebytego.com·1d
⚙️Mechanical Sympathy
Preview
Report Post
A dual context-aware basecaller for nanopore direct RNA sequencing
nature.com·22h
🔤Tokenization
Preview
Report Post
Uncovering Unfaithful CoT in Deceptive Models
lesswrong.com·7h
🛡️AI Security
Preview
Report Post
Blackbox Optimization and Hyperparameter Tuning With Google's Vizier
blog.skz.dev·1d
💰Cost-Based Optimization
Preview
Report Post
Field Notes on Scaling MoE Expert Parallelism with DeepEP
nousresearch.com·1d·
🏗️LLM Infrastructure
Preview
Report Post
Bye Bye Big Tech Step 5: AI assistents and chatbots
bitsoffreedom.nl·19m
🆕New AI
Preview
Report Post
How I Rebuilt a RAG System that Actually Works
pub.towardsai.net·1d
🔄LLM RAG Pipelines
Preview
Report Post
Gathering Time Series Data
denvaar.dev·1d
🔍Feed Discovery
Preview
Report Post
a transport layer for agentic apps
ably.com·12h·
Discuss: Hacker News
💾Prompt Caching
Preview
Report Post
Tips for Using GitHub Copilot's Agent Mode
incrementsofincrements.bearblog.dev·16h
Alpine.js
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help